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Resume. Le crible de Selberg fournit des majorants pour cer- 
taines suites arithmetiques, comme les nombres premiers et les 
nombres premiers jumeaux. Nous demontrons un theoreme de 
restriction L 2 -L p pour les majorants de ce type. Comme ap- 
plication immediate, nous considerons l'estimation des sommes 
cxponentielles sur les fc-uplets premiers. Soient les entiers posi- 
tifs ai,...,afc et b±,...,bk- Ecrit h(6) :— Y^n£X e ( n ®)> ou X 
est l'ensemble de tous n ^ N tel que tous les nombres a\n + 
61, ... , a,kn+bk sont premiers. Nous obtenons les bornes superieures 
pour ||ft||ij>m) P > 2, qui sont (en supposant la verite de la con- 
jecture de Hardy et Littlewood sur les fc-uplets premiers) d'ordre 
de magnitude correct. Une autre application est la suivante. En 
utilisant les theoremes de Chen et de Roth et un «principe de 
transference », nous demontrons qu'il existe une infinite de suites 
arithmetiques pi < P2 < P3 de nombres premiers, telles que cha- 
cun Pi + 2 est premier ou un produit de deux nombres premier. 

Abstract. The Selberg sieve provides majorants for certain arith- 
metic sequences, such as the primes and the twin primes. We prove 
an L 2 -L p restriction theorem for majorants of this type. An im- 
mediate application is to the estimation of exponential sums over 
prime fc- tuples. Let ai, . . . , a& and b%,...,bk be positive integers. 
Write h{9) := J^nex e ( n $)> where X is the set of all n ^ N such 
that the numbers a\n + bi, . . . , a^n + bk are all prime. We obtain 
upper bounds for ||/i||lp(T)> p > 2, which are (conditionally on the 
Hardy-Littlewood prime tuple conjecture) of the correct order of 
magnitude. As a second application we deduce from Chen's the- 
orem, Roth's theorem, and a transference principle that there are 
infinitely many arithmetic progressions p\ < P2 < P3 of primes, 
such that pi + 2 is either a prime or a product of two primes for 
each i = 1,2, 3. 
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1. Introduction and statement of results 



The exponential sum over the primes 1 



h N {6) := 5>(p0) 



has been very extensively studied. Here of course e{9) := e 2mS . Since the 
time of Hardy and Littlewood such sums have been treated by dividing the 
circle R/Z into the major arcs, which is to say points near a rational with 
small denominator, and the complement of this set, the minor arcs. 

The sum hj^iO) (or at least the very closely-related exponential sum over 
the von Mangoldt function) is discussed in |361 Ch. 3], for example. There 
it is shown how to use information concerning the distribution of primes 
in arithmetic progressions in order to get an asymptotic for hjsr(9) on the 
major arcs. On the minor arcs, hjsr(0) is shown to be rather small by an 
entirely different method, also described in 36, Ch. 3]. One consequence 
of these results is that for any p > 2 one has 



Disregarding the multiplicative constant, <|l-lj) is easily seen to be tight, 
this being a reflection of the prime number theorem, which in particular 
gives \hj^(8)\ ~ N/logN when \9\ <C 1/N. In fact, one could estimate 
\\h N \\ p asymptotically. 

In this paper we consider generalizations of this exponential sum in which 
the primes are replaced by such sets as the twin primes. More precisely, let 
F = rij=i( a j n +^j) De a polynomial which is product of k #J 1 linear factors 
with integer coefficients. We assume that a,j 7^ and that the discriminant 



is non-zero, or in other words that no two linear factors are rational multi- 
ples of each other. For any such F, we consider the set 

X = X(F) := {n G Z + : F(n) is the product of k primes }. 



The letter p will be used both for primes and for exponents in Holder's inequality; there 
should be no danger of confusion. All summations over p are over the primes, and summations 
over other variables are over the positive integers unless otherwise indicated. Most of the nota- 
tion in the introduction is very standard - in the event of further clarification being required, 
consultation of J5]is recommended. 
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For example, if F(n) := n(n + 2), then X is the collection of twin primes. 
For any q ^ 1, we introduce the quantity 7(g) = 7(5, F) defined by 

(1.2) 7(g) := q- l \{n G Z/gZ : (g,F(n)) = 1}|. 

Observe that ^ j(q) ^ 1. This number represents the proportion of 
residue classes modulo q which could hope to contain an infinite number of 
elements of X. We will impose the natural non- degeneracy condition 

(1.3) 7(g) > for all q > 1, 
since X(F) is clearly finite if this condition fails. 

Remark. Note that to verify Q1.3|) it suffices by the Chinese remainder 
theorem to verify it for prime q, and in fact one only needs to verify it for 
finitely many prime q, as the claim is trivial if q is larger than both k and 
the discriminant A of F. Indeed for primes p, we have 

(1.4) l(p) = 1 — k/p whenever p > k and p \ A. 

The exponential sum over the twin primes, or more generally exponential 
sums of the form 

(1.5) h(d) = h N . F {9) := e(n0), 

n<^N 
n£X(F) 

have not received much attention. One very good explanation for this is 
that, for a fixed choice of F, the set X(F) is not even known to be non- 
empty. Thus there is certainly no hope of obtaining an asymptotic formula 
for h on the major arcs, unless one is willing to obtain results on average 
over choices of F, as was done in On the other hand, the Hardy- 
Littlewood k -tuple conjecture asserts that for a fixed F 

N 

(1.6) h N , F 0) = \X(F) n 1, N I = 1 + OF (i))© FrnE _ 

log K N 

as N — > 00, where &f is the singular series 

lip) 



p v v 



From (|1.4j) we see that the infinite product in (|1.7j) is absolutely convergent. 
We will prove certain facts about & p later on, but to give the reader a feel 
for it we remark that if |aj|,|&j| ^ N then <Bp <^. t N e . Also, the IP 
average of &f over any sensible family of F (for example F(n) = n(n + h), 
h = 1, . . . , N) will be absolutely bounded. 



The conjecture (jl.6|) is still very far from being settled; however it is well 
known that sieve methods (either the Selberg sieve or the combinatorial 
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sieve) give upper bounds of the form 

(1.8) \X(F)n[l,N}\ « fe © F (i + 0F (i))_4_ 

log N 

for fixed F (see e.g. |19l Theorem 5.3]). This upper bound only differs 
from (|1,6|) by a multiplicative constant depending on k. As an immediate 
consequence of this bound and Parseval's identity one may place an upper 
bound on ||/t/v,iH|2 which (conditionally on the /c-tuple conjecture) is of the 
correct order of magnitude. Our first result (proven at the end of fJQl asserts 
that we in fact can obtain good estimates for all LP norms, p > 2, of these 
exponential sums: 

Theorem 1.1. Let F be a product ofk integer linear forms having non-zero 
discriminant, obeying the non- degeneracy condition (|1.3jl and with coeffi- 
cients ai,bi satisfying \ai\,\bi\ ^ N. Let h^,F be the exponential sum over 
X(F) Pi [1, N] as defined in (|1.5|) above. Then for every 2 < p < oo we have 
the estimate 

||% «M GpN^ilogN)-*. 

The implied constant depends on k but is otherwise independent of F; it is 
explicitly computable. 

Remark. In the converse direction, if the Hardy-Littlewood prime fc-tuple 
conjecture (|1.6|) is true, then one easily obtains the lower bound 

||% » e F N l -^{io g N)-\ 

at least for fixed F and ./V sufficiently large. This is a simple consequence of 
(Unhand the observation that h N . F (9) > ||X(F)n[l,iV]| if |0| «C c k /N and 
c/c > is sufficiently small. However, when p = 2, we see from Parseval's 
identity and the prime /c-tuple conjecture (Jl.fij) that 

\\h\\ 2 = \x(F) n [l, iV]| 1/2 » fe S^ivV^iogAr)^/ 2 

and so Theorem ll.ll is expected to fail logarithmically at the endpoint p = 2. 

The proof of Theorem 11.11 proceeds not by studying the set X(F) n [1, N] 
directly, but rather by constructing a slightly larger object Pr, which we call 
an enveloping sieve for X(F) n [1, iV]. This sieve function is considerably 
easier to analyze than X(F) n [l,iV] itself, and in particular enjoys very 
good Fourier properties. It then turns out that the only property we need 
on X(F) n [1, N] itself to deduce Theorem ll.ll is that it is majorized by this 
sieve. In particular, this theorem will also hold if the set X(F) is replaced 
by other similar sets, for instance if primes are replaced by almost primes, 
or with any subset of the almost primes. Combining this observation with 
an argument of the first author |16j . we prove the following result. 
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Theorem 1.2. Define a Chen prime to be a prime p such that p + 2 is 
either prime or a product of two primes. Then there are infinitely many 
3-term arithmetic progressions of Chen primes. 

Remark. Such a theorem, with 3 replaced by a more general integer k, 
can probably be obtained by adapting the methods of [IH|> m which the 
primes were established to contain arithmetic progressions of any length k, 
though the explicit constants obtained by those methods are likely to be 
substantially worse. Our techniques here are somewhat similar in spirit to 
those in 18 > in that they are based on establishing a transference princi- 
ple, in this case from Roth's theorem [22] concerning progressions of length 
three in a dense subset of the integers, to the corresponding result concern- 
ing dense subsets of a sufficiently pseudorandom enveloping set. In ^B] 
this transference principle was established (under quite strong pseudoran- 
domness assumptions) by the finitary analogue of ergodic theory methods. 
Here, we use a much simpler harmonic analysis argument to prove the 
transference principle required for Theorem 11.21 and as such we require a 
much less stringent pseudorandomness condition (basically, we require the 
Fourier coefficients of the enveloping sieve to be under control). 

Remark. We note that results along the lines of Theorem 11.21 can be 
approached using sieve methods and the Hardy-Littlewood circle method 
in a more classical guise. For example, Tolev showed that there are 
infinitely many 3-term progressions p\ < p2 < P3 of primes such that Pi + 2 
is a product of at most r, primes, where (n,r2,r3) can be taken to be 
(5,5,8) or (4,5,11). 

The authors would like to thank the anonymous referee for a particularly 
attentive reading of the paper, and John Friedlander for a number of helpful 
remarks concerning the material in the appendix. 

2. Notation 

We take the opportunity to introduce here some basic notation common 
to the whole paper, most of which is standard in analytic number theory. 
Some of our notation, such as the use of the expectation symbol E, is less 
standard in this field but is very convenient for certain additive problems 
of the type considered here. This is because there are ideas from ergodic 
theory bubbling just beneath the surface. 

Let k be a parameter. We write A <Cfe B or A = Ok(B) to denote the 
estimate A ^ C^B where > is a constant depending only on k. The 
notation 0^(1) will be reserved for a quantity which tends to zero as the 
variable N tends to oo, at a rate which may depend on k (and possibly 
other subscripted parameters or objects). 
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For a statement P, we will occasionally write lp to denote the indicator 
of P, thus lp = 1 if P is true and lp = if P is false. If A is a set, 
we use 1a to denote the function 1a(x) := l x eA- We will always write 
\A\ = 1a(x) for the cardinality of A. 

We write Z g for the cyclic group Z/gZ, and Z* := {a £ Z g : (a,g) = 1} 
for the multiplicatively invertible elements of Z g . We will often identify 7L q 
with {0, . . . , q — 1} or with {1, . . . , g} in the obvious manner. 

As we remarked, it is convenient to work with the language of measures 
and conditional expectation in much the same way as the authors did in 
|18j . If / : A — > C is a function and B is a non-empty finite subset of A, 
we write E xe p/(x) for the average value of / on B, that is to say 

' ' x&B 

For the purposes of this paper this subscript notation is more convenient 
than the more traditional notation E(/(x)|x 6 B) of probability theory. In 
many cases we shall just have B = A, in which case we abbreviate E xgv 4/(x) 
as EC/). 

As remarked earlier, we write e{6) = e 2m8 . If in is a positive integer then we 
introduce the related notation e m for the additive character on Z m defined 
by e m (x) = e(x/m). 

3. The enveloping sieve 

We now begin the proof of Theorem ll.il The strategy of the sieve method is 
not to study the set Xn[l, N] directly, but rather to construct an enveloping 
sieve (3r : Z + — > R + which is concentrated on X n [1, N] and for which one 
has satisfactory control of the Fourier coefficients of j3p. We will normalize 
(3 to have average value comparable to 1; from Ql.fi|) . we would thus expect 
(5r to roughly be of size log fc N/ 6p on Xf~\ [1, N] , and this indeed will be the 
case; indeed, /3r will be a pointwise majorant of (most of) the normalized 
indicator function Ck&p 1 log fc Nl X n[i,N]- 

Henceforth the polynomial F will be fixed. We shall replace the set X = 
X(F) by "localized" versions X q for various moduli q, defined by 

X q :={n£Z q :(q,F(n)) = l}. 

This set has already appeared in (jl.2|) . Observe that when reduced (modq) 
all the elements in X, with at most kq exceptions, lie in X q , the exceptions 
arising from the cases when (ijn + bj £ {1, . . . , q} for some 1 ^ j ^ k. 
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We can now describe the important properties of enveloping sieve, which 
is intended to majorize the set 

Xju = {n G Z : (d, F(n)) = 1 for all 1 < d < R} 

where R is a large integer (the sieve parameter) to be chosen later. Even- 
tually we will set R := [iV 1/20 J. 

Proposition 3.1 (The enveloping sieve). Let F be the product of k integer 
linear forms with non-zero discriminant, obeying the non- degeneracy con- 
dition (|1.3|) and with coefficients a^bi satisfying \ai\, \bi\ ^ N. Let R ^ N 
be a large integer. Then there is a non-negative function (3 := (3r : Z — > R + 
with the following properties: 

(i) (Majorant property) We have 

(3.1) 0{n) > fe & F l \og k Rl Xm (n) 

for all integers n. In particular, (3{n) is non-negative. 

(ii) (Crude upper bound) We have 

(3.2) (3{n) < fc , £ N e 

for all < n < N and e > 0. 

(iii) (Fourier expansion) We have 

(3.3) (3(n) = ^2 ^2 w(a/q)e g (-an), 

where w{a/q) = wn(a/q) obeys the bound 

(3.4) \w(a/q)\ « fej£ q^ 1 

for all q ^ R? and a G Z*. we Ziaue io(0) = u>(l) = 1. 

(iv) (Fourier vanishing properties) Let q ^ R 2 and a G Z*. If q is not 
square-free, then w{a/q) = 0. Similarly, if 7(g) = 1 and g > 1, i/ien 
w(a/q) = 0. 

Remark. The enveloping sieve is essentially a normalised version of 
the one used by Selberg in obtaining the upper bound (|1.8[) . It will 
be constructed quite explicitly in 33 where a full proof of Proposition 13.11 
will also be supplied. One does not need to know the exact construction in 
order to use Proposition 13.11 in applications. In ^HJwe offer some remarks 
which explain the phrase "enveloping sieve" , as well as a comparison of [3r 
(in the case F(n) = n) and a somewhat different majorant for the primes 
which we used in [T5j . 
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4. The Hardy-Littlewood majorant property for the enveloping 

sieve 

In Proposition 13.11 we constructed a majorant /3r for the set Xr\ which 
enjoyed good control on the Fourier coefficients; indeed, from (|3.4I) it is 
easy to obtain good IP control on the Fourier coefficients of [3r for p > 2, 
although the bound in (|3.4|) causes unavoidable logarithmic losses at the 
endpoint p = 2. When p = 4, 6, 8, . . . one can pass from this to good 
control on h by Parseval's formula, and hence by interpolation one can 
obtain Theorem II .11 in the case p ^ 4. However this simple argument does 
not seem to suffice in the more interesting region 2 2 < p < 4 because of the 
logarithmic failure of the estimate at p = 2 remarked on in the introduction. 
In this region 2 < p < 4 (or more generally for p £ 2N) there are no good 
monotonicity properties in L p of the Fourier transform to exploit; see |24l 
pl44] for a simple example where monotonicity in this sense breaks down, 
and ^ El HHj for further discussion. 

On the other hand, all known examples where monotonicity breaks down 
are rather pathological. When the majorant (3r enjoys additional Fourier 
or geometric properties it is often possible to recover estimates of Hardy- 
Littlewood majorant type. This is known as the restriction phenomenon 
and has been intensively studied in harmonic analysis; see for instance 32 
for a recent survey of this theory. 

The application of ideas from restriction theory to number theory was ini- 
tiated by Bourgain in the papers (among others). In [3], which is of 
the most relevance to us, it is shown how to obtain bounds for ||/|| p , where 
/ is a function supported on the primes. Such a bound was also obtained in 
a paper of the first author |16j , using the Brun sieve and somewhat precise 
versions of the prime number theorem for arithmetic progressions. Our 
arguments here, while certainly in a similar spirit, differ from the previous 
arguments in that we rely purely on the properties of the majorant given 
by Proposition 13.11 in particular we do not need any type of control of the 
Riemann function or of related objects. 

Proposition 14. II gives a restriction estimate for (3 in the setting of the finite 
abelian group "Ln- The dual form of this estimate, which is Proposition ^. 21 
will be of more interest for applications (indeed, Theorem 11.11 will be an 
almost immediate consequence of it). As in many arguments in restriction 
theory, the argument is based in spirit on the Tomas-Stein method, first 
used in 



For the application to finding progressions of length three, we in fact need this estimate for 
some 2 < p < 3. 
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Proposition 4.1 (L q Restriction estimate for (3). Let R,N be large num- 
bers such that l<fl< N 1 / 10 an d let k, F, (3 R be as in Proposition 
Let f : Ztv — > C be arbitrary. Then for every 1 < q < 2 we have 

(4.1) Ex^vl £ f(b)e N (bn)\ 2 (3 R (n) < q>k ( £ |/(6)| 9 ) 2/g . 

Proof. We will first examine the left-hand side of (|4.1j) in the special case 
when / is supported on a set B C Z^r and ||/||oo ^ 1- We assert that for 
every e > we have 

(4.2) E^^vl Yl f(b)e N (bn)\ 2 f3 R (n) « e , fc \B\ 1+i . 

This statement, though at first sight weaker than (|4.1|) . is actually equiva- 
lent to it. To deduce (|4.1j) from (|4.2[) . normalise so that 

(4.3) l/(6)l fl ) a/ « = I- 

In particular this implies that |/(b)| sS 1 for all 6 G Zjy. If we then define 
the sets Bj C Zjy for all j > as Bj := {& £ Z^v : 2^'" 1 < |/(6)| < 2~*}, 
then we have 

£ f{b)e N {bn) = YY1 fibMbn). 

b&L N j^O beB 3 

Also, from (|P|) we have \Bj\ ^ 2(- ?+1 ) g . Applying flO} to Bj (and with 
/(&) replaced by 2 3 f{b)) we have 

(E^ n<JV | £ /(&)ejv(M| 2 /3*(n)) 1/2 2^'|B,-|( 1 + e )/ 2 
fees, 

^ 2 ((l+e) 9 /2-l)j 

for any e > 0. If we choose e sufficiently small, then the exponent (l+e)q/2— 
1 is negative since q < 2. Summing this in j using the triangle inequality 
in L 2 p N , defined as the space of complex sequences (a n )^ =l together with 
the inner product 

((<%),(&„)) = Ei^ n <^ N a n b n (3 R (n), 
we see that (|4.1|) indeed follows (since q < 2). 

It remains to prove Q4.2JI which, recall, was to be demonstrated on the 
assumption that ||/||oo ^ 1 anci that / is supported on B. When B is large, 
say \B\ ^ R e , we can use the crude estimate (3 R (n) <Cfc, e R e (for instance) 
from (|3.2j) . combined with Parseval's identity to conclude the bound 

Ei^ivl Yl f(b)e N (bn)\ 2 (3 R (n) < fcj£ W 2 \B\ ^ |B| 1+e . 
beB 
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Thus we may take \B\ ^ R e 



We will essentially be doing Fourier analysis on Z^v, but exponentials such 
as e q (a), which arise from the Fourier expansion will also appear. 

To facilitate this somewhat awkward juxtaposition it will be convenient to 



introduce a smooth Fourier cutoff. Let ip : 



R+ 



^ 1, be a non- 



negative even bump function supported on the interval [—1/10, 1/10] whose 
Fourier transform if)(y) := j^ij){x)e{xy) dx is non-negative everywhere and 
bounded away from zero when y G [—1, 1]. Such a function can easily be 
constructed, for instance by convolving a non-negative even bump function 
supported on [—1/20, 1/20] with itself. Then we have 

Ei^ <JV | E f( b )eN(bn)\ 2 [3 R (n) « 1 £ | £ f(b)e N (bn)\ 2 (3 R (n)4>(n/N). 

beB neZ beB 

We expand the right-hand side using the Fourier expansion of (3r, (|3.3j) . to 
obtain 

^EE E E f(b)W)^/q)e N (bn)e N (-b'n)e q (-an)^(n/N). 

n&q<^B? aeZ* b,b'eB 

Now the Poisson summation formula implies that 

b-b 1 

> p.atI hn,)PAr( —h' T7,)p.„( — n,n)?/;( n. I /V ) = > lb I N Im, A- 

N 



jr E e N{bn)e N {-b'n)e q (-an)^(n/N) = ^ ^ (iV(m + 
and hence by construction of ip 

T7 E e N(bn)e N (-b'n)e q (-an)i(j(n/N) 



N 



neTL 



^ l„ fc _ 



where ||x|| denotes the distance of x to the nearest integer. Applying this 
and H3.4JI . and bounding the coefficients f(b), f(b') by one, our task thus 
reduces to proving that 

(4.4) V V V q ^ « e \B\^. 



q^R 2 a& 



b,b'eB 



|<1/7V 



Let us first dispose of the contribution of the large q, for which q ^ \B\, 
where it is possible to estimate q ,e / 2 ~ 1 by |2?| € / 2_1 . Observe that for any 
two distinct fractions a/q, a'/q' with q, q' ^ R 2 and a G Z* 
have 



2 n „ A „ r- n>* a > e j* we 



1 1 2 

> — > — ^ — 
qq' R A N 



since iV 3> -R 10 • This shows that for each fixed 6, 6' there is at most one 
fraction ~ which contributes to the sum (|4.4|) . Thus the contribution of the 
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large q is at most |_B| 2 |i?| e / 2_1 , which is acceptable. It will thus suffice to 
estimate the sum over small q, that is to say those q for which q ^ \B\, and 
to that end it is enough to prove that 



b-b oii^jyjv 



(«) EE E 

q^\B\ aSZ* 6,6'eB 
II j ' _ 

II JV q 

We estimate the left-hand side by 

c E E E ^E^ 

9^|B| aeZ, 6,6'GB mGZ 



« e \B\ 1+ ^ 2 . 



m 



b — b' a> , 



where c = max^gr^!] IV'C?/)! X > which by the Poisson summation formula 



is 



cN ~ l E E E q-^MnMeNUP 

q^\B\a<=Z q b,b'<=B ragZ 

Performing the summation over a, this becomes 



■an). 



cN ^ E E E^("/ Ar )^(( 6 - 6 ': 



g<|B| 6,6'eBneZ 

which we can rearrange as 
(4.6) cat 1 
where 



E eAr ( &n ' 
6eB 



d lBl (n):= Y, 1- 



?<|B| 

The n = term of this sum may be bounded, very crudely, by 0(N~ 1 \B\ 3 ). 
Since \B\ ^ R e <C iV 1 / 10 , this term is certainly at most 0(|5|) and hence 
does not make an important contribution to (|4,5|) . Let us thus discard 
the n = term, and bound the remaining terms using Holder's inequality. 
Since tp{n/N) is supported in the region \n\ ^ N/2, we thus obtain 

(USD « (E <|nKiV/2lE e ^( 6n )| 2+e )^(Eo<|n|<iV/2rf|B|(n)^)^. 
feeB 

Parseval and the crude bound | Ylb&B e iv(^ n )l ^1-^1 yield 

(E <|n|^/ 2 |E e ^( 6n )| 2+£ ) 5 * 7 < (\ B \ 1+e )^ = \B\ 1+ ^. 

beB 
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On the other hand, from standard moment estimates for the restricted 
divisor function d\ B \ (see for instance 01 HDD we have 

E 0<|n|^Af/2^|B|("-) m <e,m \B\ e 

for any m ^ 1 (in fact one can replace \B\ e with log 2 _1 \B\ when m is 
an integer - see (30]). Applying this with m := 2 ^ and working back, one 
gets (|4,6|) <C e \B\ 1+e / 2 , which in turn implies (|4.5|) and (|4,4j) . and hence the 
proposition. □ 

As promised, we now give what is essentially a dual form of Proposition 

Proposition 4.2 (L p extension estimate for (5). Let R,N be large numbers 
such that 1 -C R -C iV 1 / 10 and Zei fe, i 7 , (3r be as in Proposition^^ Let 
{d n }n^N be an arbitrary sequence of complex numbers, and let p > 2 be a 
fixed real number. Then we have 

(4.7) ( \^i^Na n f3 R (n)e N (-bn)\ p ) 1/p < p , fc (E Kn ^|a n | 2 /3 J? (n)) 1/2 

bez N 

and 

(4.8) ||E 1<n ^a n /3 fi (n)e(n0)|| LP(T) « M ^/'(Ex^^a^^n)) 1 / 3 . 

Proof. The first claim follows from Proposition ^. H and a duality argument. 
We will expand on this point somewhat using the language of operators, 
since this will afford us an opportunity to at least partially explain the 
terms restriction and extension. Everything we say could also be phrased 
in terms of inequalities on exponential sums. Recall that L 2 ^ N is the space 
of complex sequences (a n )^ =1 together with the inner product 

(K), (M) = IEi< s „,< S Ara n 6 n /3i ? (n). 

Consider also the space of complex- valued functions on Zjy together with 
the l q norms 

i«<v) := ( E \f( x )\ q ) 1/g 



and the inner product (f,g) := YlxeZ N f{ x )d{ x )- Here ; i 1 ,---,^} may 
perhaps be thought of as a finite version of Z, whilst Zat plays the role of a 
discretised circle T. Now define the restriction map T : Z 9 (Zjv) — > Lj| ^ by 

/ ~ ( £ f{x)e N {nx))» =l 

(the reason for the name is that T can be thought of as the restriction of 
the discrete Fourier transform on Z^v to the weight j3) . One can check that 
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the adjoint of T is the extension map T* : Ll N — ► / p (Ztv) defined by 



(«n)n=l ^ El^nsCAfOn/3 J R(ra)eiv(-n3;). 

Here, of course, p is the dual exponent to g defined by the relation p _1 + 
q^ 1 = 1; note, however, that all of the spaces l r (Z,]y) are equivalent as 
vector spaces. 

Now one can check that (|4.1|) . the main result of Proposition 14.11 is equiv- 
alent to 

ll T lli8(Z K )^L|^ < P ,fc 1- 
On the other hand ([4.7)1 is equivalent to the statement 

\\ T *\\lI n ^Ip(Z n ) <p,k I- 

By the duality principle for operator norms one sees that these two state- 
ments are completely equivalent, and so (|4.7j) is an immediate deduction 
from Proposition 14.11 

To prove (|4.8[) . observe that for any ^ < 1 we can obtain the estimate 



\E-i^Na n R (n)e((b + 6)n/N)\ p < p , fc (Ei 



p/2 



from (|4.7|) simply by multiplying each coefficient a n by ex (On). Integrating 
this in from to 1 gives (|4.8[) . □ 

Remark. A result of Marcinkiewicz |38| allows one to obtain (|4.7[) from 
(|4.8|1 : see ^H] for further discussion. 

The utility of Proposition 14.21 is markedly increased by the next lemma. 
Lemma 4.1. Suppose that R ^ \/iV. Then 

El^ n <gv/3ii(n) < 1. 
Proof. We make further use of the smooth cutoff function ^. We have 

Ei^Nfatn) < — J2i>(n/N)/3 R (n). 

Substituting in the Fourier expansion of and using the Poisson 

summation formula, we have 

N- 1 ^(n/N)P R (n)= £ £ w(a/q) £ ^(iV(m - -)) . 
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Observe that since R 2 ^ N, the only time that ijj(N(m — -)) is non-zero is 
when m = a = and q = 1. Thus, by (|3.4[) . we obtain 

^ Y,$(n/N)Pn{n) = w(0)^(0) « 1, 

and the claim follows. □ 

Observe that the last lemma, combined with Proposition ^. If i). implies the 
classical sieve estimate (|1.8|) . With these preliminaries in place we can now 
quickly conclude the proof of Theorem 11.11 

Proof of Theorem li.il Fix R := [A^ 1 / 20 ] (for instance). Without loss of 
generality we may assume N to be sufficiently large depending on k and 
p. We now apply Proposition 14.21 to the sequence a n := lxnx m (n)//3(n). 
From (|,S.1[) . (|4.<S|) and Lemma 14.11 we easily obtain 

II E e M)L(T) <** 

xnx fl! n[i,iV] s ^ 

Since all but at most kR elements of X lie in Xr\, the claim now follows 
since R ~ jV 1 / 20 and TV is assumed sufficiently large. □ 

Remark. A straightforward modification of the above argument also gives 
the estimate 

l ^ 

/ i T,(tl A ^ n+b ^< ne ^ p de ©f^ 1 ^ 

where A is the von Mangoldt function. 

5. A transference principle for 3-term arithmetic progressions 

There is a well-known theorem of Roth |29| , partially answering a question 
of Erdos and Turan, which states that any subset of {1, . . . , A^} with size 
at least cN / 'log log A" contains an arithmetic progression of length 3. In 
this section we will prove a result in this spirit for sets which are majorised 
by a "pseudorandom measure" . In ^Hl we will apply this result to deduce 
that there are infinitely many 3-term progressions of Chen primes (primes 
p such that p + 2 is prime or a product of two primes). 

We refer to the result (Proposition 15. l) l as a transference principle because, 
rather than prove it from first principles, we deduce it from Roth's theorem. 
For the reader familiar with ^H] , we remark that the transference principle 
we prove here is shown to hold under harmonic analysis conditions which 
should be regarded as much less stringent than the combinatorial conditions 
of that paper. However, our arguments here only apply to progressions of 
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length 3. Indeed, it is by now well-understood that traditional harmonic 
analysis arguments cannot suffice to deal with progressions of length 4 or 
more (see for a further discussion). 

We begin by recalling a formulation of Roth's theorem [^5] due to Varna- 
vides [3*5] . 

Theorem 5.1. |29| I35| Let < 5 ^ 1 be a positive numbers, and let N be a 
large prime parameter. Suppose that f : Z^r — ► K + is a function satisfying 
the uniform bounds 

< f(n) < 1 for aline Z N 

and the mean property 

E(/) > 8. 

Then we have 

(5.1) E MeZjv /(n)/(n + d)/(n + 2d) ^ c(<5) 
/or some constant c(5) > depending only on 5. 

Remark. By combining Bourgain's refinement [£] of Roth's theorem with 
the argument of Varnavides |32 d one can obtain the more explicit value 
c(5) = exp(— CS~ 2 log(l + ^)) for c(6~); see also [Oil for further discussion. 
However, our arguments will not be sensitive to the exact value of c(S). 

We now generalise the above proposition to the case where / is not bounded 
by 1, but is instead bounded by some suitably pseudorandom function. 

Definition. We shall normalize the Fourier transform / : Zjy — ► C of a 
function / : Zjy — ► C according to the formula 

f(a) := E ne z N f(n)eN(an) for all a G Zjv, 

and recall the Fourier inversion formula 

(5.2) f(n) = f(a)e N (-an). 

With this normalisation it is natural to measure the Fourier transform / 
using the l p norms 

\\f\\ p P ■■= £ i/wi* 

for 1 < p < oo, with the usual convention that ||/||oo := sn V a ez N l/( a )l- 

Proposition 5.1. Let N be a large prime parameter. Suppose that v : 
Ztv — > K + and f : Zjy — > R + are non-negative functions satisfying the 
majorization condition 

(5.3) < f(n) < i/(n) /or a// n £ Zjv 
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and the mean condition 

(5.4) ^n& N f{n) > 5 

for some < S ^ 1. Suppose that v and f also obey the pseudorandomness 
conditions 

(5.5) ~ $a,o\ ^ T) f or oil a G Ztv 
where 5 a fi is the Kronecker delta, and 

(5-6) ||/|| g < M 

for some rj,M > and some 2 < q < 3. TTien we have 

(5.7) E nideZjv /(n)/(n + d)f{n + 2d) ^ § c ($) - 0, >Af> ,fa) 

where c{5) > is the same constant that appears in Theorem \5.1\ 

Remark. Observe that the v = 1, ?y = case of this proposition is basi- 
cally Theorem 15.11 with an (inconsequential) loss of ^ on the right-hand 
side of (|5.1|) . This theorem should be compared with |18l Theorem 3.5]. In 
practice, the condition (j5.fi|) seems to be most easily verified by first estab- 
lishing a Hardy-Littlewood majorant property such as Proposition 14.21 for 
all functions / majorized by v, not just the specific function / of interest. 

Proof. We repeat the arguments of ^3 §6]. Let < e <C 1 be a small 
parameter to be chosen later, and let 

(5.8) n-.= {aeZ N : \f(a)\ > e} 

be the set of large Fourier coefficents of /. Write B := B(£l, e) for the Bohr 
set 

(5.9) B := B(tt,e) := {m G Z N : |1 - e N (am)\ ^ e for all a G Q}. 

We shall decompose / = /i + /2, where /i is the anti-uniform component 
of /, defined by 

(5.10) /i(n) := E mijm2&B /(n + m%- m 2 ), 

and /2 := f — fi will be the uniform component. We can thus split the 
left-hand side of Q5.7JI into eight components 

(5.11) E Ut( iei N fi(n)fj(n + d)f k {n + 2d) 
where i,j, k G {1, 2}. 

Let us first consider the term (|5.11|) corresponding to (i,j,k) = (1,1,1), 
which will be the dominant term. The pseudorandomness condition ()5.5|) 
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can be used to obtain uniform bounds on f\. Indeed using that together 
with (|5.2|) . (|5.3f) and Parseval's identity we obtain 

|/i(n)| = \E mi>m2<EB f(n + m 1 - m 2 )| 
< E miiTn2eB u(n + m 1 - m 2 ) 

= |E mijm2gjB ^2 v{a)e N (-an - ami + am 2 )\ 



= I v(a)e N (-an)\E m&B e N {-arn) 
< ^2 |i>(a)||E meB eAr(- 



-am)\ 

<l + 77 ^ |E m6jB eAr(-am)| 2 

= 1 + 77JV/|J3|. 
Note, however, that by the pigeonhole principle we have 

\B\ > (ce) |n ljV 

for some absolute constant c > 0, while from (|5.6|) and Chebyshev's in- 
equality we have 

\n\ ^ {M/e) q . 

Finally, f\ is manifestly non-negative by (|5.1U|) and (|5.3j) . We thus conclude 
that 

< /i(n) < 1 + O e ,M >g (r?) for all n G Zjv. 
Furthermore from (|5.4|) we have 

E(/i) =!(/)> 5. 

Applying Theorem l5.ll (possibly modifying f\ by O e ^M,q{ff) first) we obtain 

(5.12) E x4&N h{x)h{x + d)h{x + 2d) > c(5) - O e>s> M,M- 

Now we consider the other seven terms of the form 1)5.11(1 . when at least 
one of the i, j, k is equal to 2. The treatment of the seven cases is essentially 
the same, so for simplicity we consider only i = j = k = 2. Observe that 
for all a E Z^v we have 

fi(a) = E neZjv E miim2eB /(n + m 1 - m 2 )e N (an) 

= E neZN E mum2eB f(n + rn 1 -m 2 )e N (a(n + m 1 -rn 2 )x 

(5.13) 

x eN{— ami)eN{am 2 ) 
= f(a)\E meB e(-am)\ 2 , 



18 



Ben Green, Terence Tao 



and thus 

(5.14) |/ a (o)| = |/(a)|(l - \E meB e N (-am)\ 2 ) < |/(a)|. 

This can be analysed in two different ways according as a does or does not 
lie in f2. When a f2 we have 

\h(a)\ < |/(o)| < e 

by (|5.8|) . On the other hand, when a £ fi, then |e7v(— am) — 1| ^ e for all 
m £ B by definition l|5.9|) . and hence in this case we have 

\f 2 (a)\ = \f(a)\(l - |E meseiV (-am)| 2 ) < 3eE(i/) < 36(1 + 7?) 

thanks to the a = case of (|5.5j) . Thus certainly 

(5-15) H/slU = O v (e). 

Now by orthogonality we obtain the well-known identity 

^n,d& N f2(n)f 2 (n + d)f 2 (.n + 2d) = ]T / 2 (a) 2 / 2 (-2a). 

Such an expression is often treated using an L 2 -L°° inequality, but for us 
this is too expensive since we lack good L 2 control on u (and hence on / 2 ). 
Instead, we shall use the L q hypothesis (|5.6fl . By Holder's inequality we 
obtain 

(5.16) \^ n , d & N f2(n)f 2 (n + d)f 2 (n + 2d)\ ^ ||/2^||7 2 ||^- 

From (|5.fij) and ()5.14|) we have 

II7II, < ll/H, <M. 
Recalling ()5.15|) we therefore obtain, by (|5.16|) . the bound 

^nA& N h(n)f 2 {n + d)f 2 {n + 2d) = 0(M"e 3 - g ). 

By choosing e sufficiently small depending on 5, M, q, this can be made 
less than jgc(S). The same goes for the six other terms of the form (j5.11|) 
with (i, j, k) ^ (1, 1, 1). Combining this with (|5.12|) . the claim follows. □ 

6. Arithmetic progressions of Chen primes 

In this section we prove Theorem II .21 which stated that there are infinitely 
many 3-term progressions of Chen primes. A Chen prime, recall, is a prime 
p such that p + 2 is a product of at most two primes, and Chen's famous 
theorem jS] is that there are infinitely many such primes. Chen actually 
proved a somewhat stronger result, in which the smallest prime factor of 
p + 2 can be bounded below by p 1 / 10 . It is important for us to have this 
extra information (though any positive number would do in place of 1/10). 
In Iwaniec's unpublished notes [121 one ma Y find a proof of Chen's theorem 
which leads to the value 7 = 3/11. For the purposes of this section we will 
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use the term Chen prime to refer to a p for which p + 2 is either prime or 
a product P1P2, with p\,p2 > p 3 ^ 11 - We quote, then, the following result 
from jH]. 3 

Theorem 6.1. Let N be a large integer. Then the number of Chen primes 
in the interval (N/2,N] is at least ciiV/log 2 N, for some absolute constant 
ci > 0. 

For our argument we do not need to know an exact value for c\. To have 
a chance of deducing Theorem 11.21 from Proposition 15,11 we shall need a 
suitably pseudorandom measure v on the cyclic group Zjy which majorises 
the Chen primes on [N/4, N/2] in a reasonably efficient way. Unfortunately, 
the Chen primes are so irregularly distributed in progressions a (mod q) (q 
small) that there is no v which would obey the pseudorandomness condition 
1)5. 5 J) for an rj which is small enough to be useful. To get around this 
difficulty, we shall employ a device which we call the W-trick. Let 4 > 1 
be a very large real number (independent of N) to be specified later, and 
write W = Wt := Y\ p <^ t P- Let X\y Q 7L>w denote the residue classes b £ Z\y 
such that b and 6 + 2 are both coprime to W; observe that 

(6.1) \X W \ = W 11 (l--)«|TU|/log 2 t. 

Note that all but 0(W) of the Chen primes lie in one of the residue classes 
associated to Xw- From Theorem 16 . 1 1 we thus have 

EWN 
\{N 4 s^n^ N/2 :Wn + b is & Chen prime} I > — = 0(W). 

b ,x w l °^ WN ) 

If we assume N sufficiently large depending on W (and hence on t), the 
right-hand side is 3> WN/ log 2 N. From ()6.1|) and the pigeonhole principle 
we may therefore choose b 6 X\y such that 

/ s , , Nlog 2 t 

6.2 X » 

K ' 11 log 2 N 

where X is the set 

X := {N/4 ^ n < N/2 : Wn + b is a Chen prime}. 

Fix b £ Xw as above, and consider the polynomial F(n) := (Wn + b)(Wn + 
b + 2). To compute Si?, we observe that jip) = 1 when p ^ t and ^(p) = 



^Theorem 16. II is stronger than the result stated in 1221 in one tiny way - Iwaniec shows that 
there are many Chen primes in [0, N], not (N/2, N]. However it is completely clear that the proof 
can be adapted in a simple way to cover this case. Alternatively, a pigeonhole argument could 
be used to show that (N/2, N] contains lots of Chen primes for many N, which would suffice for 
our purposes. 
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1 — 2/p otherwise. Since the Euler product Y\ p (l — 2/p)/(l — 1/p) 2 is 
convergent, we see by the same computation used in (|6.1|) that 

(6.3) \og 2 t < e F < log 2 t. 

Now let R := LiV 1/20 J and let /3 R : Z -> M+ be the enveloping sieve associ- 
ated to .F by ProDosition l3.11 We restrict this function to the set {1, ... , N}, 
which we identify with Zjy, and let v : Z^ — ► M + be the resulting function. 

Lemma 6.1. For all a £ Zn and e > 0, we have 

v{a) = 5 a , Q + O e {t e ~ 1 ) 
where 5 a fi is the Kronecker delta function. 

Proof. From (|3.3|) we have 

v{a) = E n&N (3 R {n)e N (an) = ^ ^ w(b/ ' q)E n&N e N (an)e q {-bn). 

q^R 2 b£Z q 
(b,q)=l 

When 1 < q ^ t we have j(q) = 1, and hence w(b/q) = by Proposition 
ETTTiv). By (jS3J) we thus have 
(6.4) 

v{a) = E n( zz N e N (an) + O e (t e_1 |E neZiV eAr(an)eg(-6n)|). 

t^q^R 2 b& q 

Observe that E ng ^ iV eAr(an) is just <5 ai o- From standard estimates on the 
Dirichlet kernel we have 

(6.5) \E n £z N eN(an)e q (— bn)\ <C min(l, 



All the fractions b/q, q ^ R , are separated from each other by at least 
1/R A ^ l/N. Thus we have 1/N\\§ - -|| < R A /N with at most one 
exception, which leads in view of (|fi.4|) and (|6.5|) to 

u{a) = 5 afi + O t {f-\l + R^)). 
The claim follows. □ 

Proof of Theorem \1. 6 A We fix t to be an absolute constant to be chosen 
later, and take N to be a sufficiently large prime depending on t. We define 
the function / : Zn — > M + by 

log 2 iV 
/(n) := c 2 ly(ra), 
log t 

and verify the various conditions of Proposition 15. II for /. 
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First of all observe from Proposition I3.1f i) and ((6,3)1 that we have the 
majorization property (|5.3|) if c > is chosen sufficiently small. Secondly, 
from ()6.2(l we have the condition 1)5.4(1 with some absolute constant 5 > 
independent of N or t, as long as N is large enough. From Proposition 14. 21 
(specifically 1)4.7(1 ) and Lemma 14.11 we obtain the bound H/H5/2 ^ M for 
some M > independent of TV" or t, if N is sufficiently large depending on 
t. This confirms condition 1)5.6)) (with q = 5/2). 

Finally, from Lemma 16.11 we obtain ()5.5j) with rj = € (i e_1 ) and Mq = 

Thus we may indeed apply Proposition I5.1( and doing so we obtain 

E MeZjv /(x)/(x + d)f(x + 2d)>c- O^f- 1 ) 

for some absolute constant c > 0. By choosing e := 1/2 (say) and t suffi- 
ciently large, we thus obtain 

E Xtd&N f(x)f(x + d)f(x + 2d) > c/2. 

From the definition of /, this yields 

|{(x, d) E 7L\ : x, x + d, x + 2d g X}\ »< iV 2 / log 6 iV. 

The degenerate case d = yields at most O(N) pairs and can be discarded 
when TV" is large. Since X is contained in [N/4,N/2] we see that x,x + 
d,x + 2d is an arithmetic progression in Z and not just in Z^v- Theorem 
Ol follows. □ 

Remark. We have in fact shown that the number of triples (pi,P2,P3) of 
Chen primes with pi + P3 = 2p2 and with each pi ^ N is 3> iV 2 / log 6 iV; it 
is possible to show from ()1.8|) that this bound is sharp up to an absolute 
multiplicative constant. It is clear that if one had a lower bound tt2(N) S> 
N/ log 2 iV for the number of twin primes less than iV (this, of course, is 
one of the most famous open conjectures in prime number theory) then a 
simple adaptation of our argument would produce infinitely many triples 
{p\,P2)P'i) of twin primes in arithmetic progression, and the number of such 
triples less than N would be S> iV 2 /log 6 N. Similar remarks hold for any 
other constellation of primes. 

7. Appendix: Construction of the enveloping sieve 

In this appendix we construct the enveloping sieve for Proposition 13. II and 
verify its properties. Some of the material here is rather standard and will 
not be used directly in the rest of the paper, except via Proposition 13.11 
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Let us recall the setup. We took a fixed polynomial F and defined the 
"locally sieved" sets 

X q = {n£Z q :(q,F(n)) = l}. 
We begin by analyzing these locally sieved sets X q . From (|1.2j) we have 

(7.1) j(q) = Kez q l( q ,F(n))=i = = ~ q \X q \. 

From the Chinese remainder theorem we see that \X qq /\ = \X q \\X q i\ when- 
ever (q,q') = 1, and also that |X p m| = p m_1 |X p | whenever p m is a prime 
power. From this and (|7.1j) we see that 7 is a multiplicative function, and 
indeed 

(7.2) 7(q) = i[l(p). 

p\q 

Observe that if q\M then the projection n 1— > ra(modg) will map Xm to 
X q . Indeed, this map has uniform fibres: 

Lemma 7.1. If q\M , then 

7(M) 

E rieZA/,n=m(mod 9 )lx A/ (n) = ^y(g) ^~ X 1 ^ 

for all m E 7i q . 

Proof. Let r be the largest factor of M which is coprime to q. We observe 
that if n(modg) = m, then 

1 x M (n) = l Xq (m)l Xr (n{modr)). 

From the Chinese remainder theorem we thus see that 

^nez M ,n=rn(modq)lx M (n) = l Xg (m)E(lx r ) = l Xq (m)j(r) 

and the claim follows from ()7.2|) . □ 

We now analyze the Fourier behaviour of the X q . For each ijGN and a£Z 
we make the temporary definition 

(7.3) t(a,q) := E neXq e q (an) = — j— ^ e q {an). 

n£Xq 

Note that the denominator is non-zero by p. 31) . 

Lemma 7.2. If q G Z + and a G Z*, i/ien t(a, q) = t{ak, qk) for any k G Z + . 

Proof. Let A:' be the largest factor of k which is coprime to q. Observe 
that n G X q ^ if and only if n(modq) G X q and n(modk') G Since 
e q (an) depends only on n(modg), we thus see from the Chinese remainder 
theorem that 

^nex qk e q (an) = E neXq e q (an), 
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and the claim follows. □ 

As a consequence of this we may define the normalised Fourier coefficients 
s(a/q) by 

(7.4) s(a/q):=t(a,q). 

We now record some useful identities and estimates for s(a/q). 

Lemma 7.3. Let q,q' 6 Z + and a, a' £ Z. Then the following statements 
hold. 

(i) ,(0) = 1. 

(ii) s is periodic modulo 1. In particular when considering s(a/q), we 
may think of a as an element of "L q . 

(hi) U (q,q') = 1, then 

s(a/q + a'/q') = s(a/q)s(a /q'). 

(iv) // a £ Z* and q is not square- free, s(a/q) = 0. 

(v) If a £ 7L* q , q > 1, and j(q) = 1, then s(a/q) = 0. 

(vi) If a € l\, then \s(a/q)\ ^ U P \ q ^ ~ t(p))/t(p)- 

Proof. The first two claims are immediate from ()7.4(l . To confirm (iii), 
define b(modqq') by the congruences b = aq' (mod q) and b = a'g(modg'), 
and r,r' by the relations qr = l(modg'), q'r' = l(modg). Then it is easy 
to check that 

(7.5) s(a/q)s{a /q) = ) V V e qq/ (b(q'r'n + qrn')) . 



Observe that q'r'n+qrn' is congruent to n(mod q) and to n'(mod q'). Hence 
the right-hand side of ()7.5|) is indeed just 

^2 e qq/ (bm) = s(b/qq) = s{a/q + a'/q'). 



lXqq ' l meX qql 

To prove (iv), we write q = p 2 k for some prime p and integer k, and observe 
that X p 2 k is periodic mod pk. Since | is not an integer divided by p/c, we 
see that the expression in (|7.4|) necessarily vanishes. The fifth claim (v) 
follows by inspection. Finally, we prove (vi). By (iv) we may assume q is 
square-free. By (iii) we may assume q is prime. But then (vi) follows by 
writing 

^2 e q {an) = ^ e q (an) - ^ e q (an), 

n£X q n&L q n£Z q \X q 

and observing that the former sum vanishes since a £ Z*, while the latter 
sum is at most \Z q \X q \ = q(l — j(q)) by (|7.1|) . □ 
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Now we can write lx M as a Fourier series, with coefficients determined by 
the 7 and s functions. 

Lemma 7.4. For any M and any n G 1*m, we have 

(7.6) l XM (n) = 7 (M) £ ^ 8 (a/g)e,(-on). 

Proof. The right-hand side can be rewritten as "f(M) YlbeZ s(b/M)eM(—bn). 
But from (|73|) and JZ3J) we have -y(M)s(b/M) = E ne z M lx M {n)e M (bn). 
The claim then follows from the Fourier inversion formula. □ 

Motivated by the identity (|7.6|) , let us define the function 4 oir : Z — > R by 

(7.7) aij(n) := ^ ^ s(a/q)e q (-an). 

qS^R agZ* 

Thus a# is some sort of truncated version of 1x h! /7(-R!)j w hich is normal- 
ized to have average value 1 (as can be seen by considering the q = 1 term 
of the above sum, which is the only term which does not oscillate in n). 

Lemma 7.5. Let h denote the multiplicative function 



(7.8) h(q) := ^q) 2 l[- 



PI 9 



lip) 



Then for all n G Xr\, we have ctR,{n) = G(R), where G(R) = G(F,R) is 
the quantity 

(7.9) G(R):=Y,h(q). 

Remark. Note in particular that G(R) is positive (the q = 1 summand is 
equal to 1). 

Proof. Fix n G Xr\. Recall the definition (|7.7|) of a^j. In the first instance 
we will work with the inner sum X^aGZ* S ( a /Q) e q(~ an ) occurring in that 
definition. We may restrict q to be square-free since only such q contribute 



4 While at first glance this expression appears to be complex-valued, it is in fact real, as one 
can see from conjugation symmetry, or from more explicitly real formulae for an below such as 
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to the sum (|7.7|) in view of Lemma 17.31 (iv) . We have 

s{a/q)eq(-an) = E mg x 9 eg(am)e 9 (-an) 

agZ* 

Emex, ^ e g (a(m - n))l ((l)? ) =1 
E m£X 9 ^ e g (a(m-n)) ^ ^(d) 
^^(d)E meXq ^2 e q {a{m-n)) 



d\q ad"L q :d\a 

q 

'd 

d\q 

fi(d)E meZq >fn = n ( mod q / d ) lx q (m) 



/I ^(d)E me x q -^i- m sn(modq/d) 



E 



dk E meXq l Xq (m) 



By Lemma 17.11 we have 

^m& q ,m=n{Yaodq/d)^-X q {'m) _ 1 



-!x„ /rf (n) 



E meZq l Xq (m) 7 (g/d) ^ 

Now we substitute into (|7.7jl . the definition of oir. Since g is being supposed 
square-free, d and q/d are coprime. Hence 

(7.10) M») = £A) E 

Since n G Xr, we have (^,.F(n)) = 1 for all q ^ R and Furthermore 
from the multiplicativity of 7 we have 

and the claim follows. □ 

The function an is not quite a candidate for an enveloping sieve, because 
it can be negative. To resolve this problem, we shall simply square (and 
renormalize) an, defining 

(7-11) R (n) := 7^yl«H(n)| 2 . 

Clearly /3r is non-negative, has Rl as a period, and equals G(R) on Xr\ . We 
now give a Fourier representation for (3r similar to (|7.7j) . but with slightly 
larger Fourier coefficients. 
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Proposition 7.1. [2H! We have the representation 

Pn( n ) = ^ ^2 w{a/q)e q (-an), 

q^R 2 aGZ* 

where the coefficients w(a/q) obey the bounds 
(7.12) \w(a/q)\ ^ 3 m(q) \s{a/q)\ 

(here w(q) denotes the number of prime factors of q). Also we have w(0) = 
1. 

Remark. More precise asymptotics for w(a/q) are available, see |28| (and 
see [27j for an even more precise statement in the case F(n) = n). We will 
not need these refinements here, however. 

Proof. We follow the arguments of Ramare and Ruzsa [2S] . Our starting 
point is the formula ()7.1(Jj) . which was derived for all n (not just those in 
Xr\). Writing q = dr, we thus have 5 , 



(7.13) a R (n)= £ ^l (r> 



;F(n))=l- 



d,r:dr^R ,V ; 

Inserting (|7.13|) into (|7.11|) . we obtain 

Mn) ~G{R) L> 7 ( ri ) 7 (r 2 ) 1 (H^]^W)=i' 

rfi,ri,d2,r , 2 

where [?T,r2] denotes the least common multiple of r and r' . Also note 
that an, and hence is periodic with period R\. By Fourier inversion, 
we then have 

Pr{ U ) = ^2Y1 W ( a /(l) e q(- an ) 
q\R\ aGZ* 

where 

W(a/{?) := ™ eZii! ^ (ri) (r N 1 (H,r 2 ],F(n))=ie g (an). 

a , i,ri,d2,r 2 

Fix a, g. Observe that the quantity l/r^^i i?( n ))=i, as a function of n, is pe- 
riodic with period [ri,T2]. Thus E ne z H , lQ ri)r2 ] j?(„)) =1 e (an) vanishes unless 
g divides [?T,r2], in which case the expression is equal to s(a/q)'y([ri, r 2 ]) 



^Note that the contribution of the q which are not square free can be added in, since it is easy 
to show that ^2 dr — q ^?yl(r,F(n))=i = f° r all such g. Indeed, if p 2 |q, then the contributions 
of the cases p\d,p\r and p j(d,p 2 \r to this sum will cancel each other out. 
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by (|7.4|) . In particular, since [ri,^] R 2 , we know that w(a/q) vanishes 
when q > R 2 , and in the remaining cases q ^ R 2 we have 

s(o/g) /j.(di)fi(d 2 ) n n 

" (0/{?) = G(R) d ^ W 7 """ 2 ^ 

dl,ri,d2,r2 
din,d2r2^R 
q\[ri,r2] 

From (f?~2|) and l|T%Jl we have 

7(^1)7(^2) ..P. 7(p) II + E 
p|(ri,r2j p|(ri,r-2j t|(n,r2) 

and thus 

(7.14) w{a/q) = ±^J2Ht) E v(d)Kd'). 

^ ' t^R d 1 ,r 1 ,d 2 ,r 2 
dir\,d2r2^R 
q\[ri,r2\ 
t\(ri,T2) 

Thus to prove (|7.12|) . it suffices by (|7.9|) to show that 



di,ri,d2,r , 2 
d\r\,d2T2 ^R 
q\[ri,r2] 
t \(ri ,r 2 ) 

for all t ^ R. While this is in fact true for all q, it in fact suffices in light of 
Proposition [721 (iv) to verify it for square-free q, in which case the formulae 
are slightly simpler. 

Fix t. We take advantage of the constraint t\(ri,r2) to write fj = tfj, 
lj = djfj, and q := q/(q,t), and observe that the constraint <?|[ri,r 2 ] is 
equivalent to g| [r^,^]- Also, q is square-free. Thus we reduce to proving 
that 

(7.15) I E A(h,h)\^3™ {q) 

h,l 2 ^R/t 

where 

A{h,h)-= E v{h/h)v{h/h)- 

ri|Ji,fa|/a 
f|[n,r 2 ] 

This expression can be worked out quite explicitly: 

Lemma 7.6. Let q be square-free. Then the quantity A(lx,l>i) vanishes 
unless h,h\Q and qlhh, in which case \A(li,l2)\ = 1. 
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Proof. It suffices to prove this claim in the "local" case when h,l 2 ,q 
are all powers of a single prime p, since the general case then follows by 
splitting h,l 2 ,q into prime factors and exploiting the multiplicativity of the 
summation in A(l\, l 2 ). If g = 1 then the constraint q\ [fi, r 2 ] is vacuous and 
the claim follows from Mobius inversion. Since q is square-free, the only 
remaining case is when q = p. In this case, q\ [fi,f 2 ] ^ an( ^ oru y if n °t both 
of fi, f 2 equal one, and so 

A{h,l 2 )= Yl rih/ri)rih/r 2 ) - (i(h)(i(l 2 ). 

fi\hfy\h 

Applying Mobius inversion (or direct computation) we thus see that the ex- 
pression A(li,l 2 ) equals —1 when (li,l 2 ) = (p,p), equals +1 when (h,l 2 ) = 
(p, 1), (l,p), and vanishes otherwise, and the claim follows. □ 

From this lemma we see that there are at most 3 ro ® 3 TO (?) p a i rs (/ 1; / 2 ) for 
which A(li,l 2 ) is non-zero, and in each of these cases we have fe)! = 1- 

The claim (f?"T5]) follows. Using lf7H|) . this proves (jTJH) . 

In the case a = and q = 1, a closer inspection of the above argument 
shows that the quantity in absolute values in Q7.15JI is in fact exactly equal 
to 1. In view of (|7.14|l . Lemma 1731 (i) and (|7.9j) . this gives w(0) = 1 as 
desired. This concludes the proof of Proposition 17. II □ 

Finally, we rewrite or (and hence (3r) in the more familiar notation of the 
Selberg sieve. This will allow us to conclude the crude estimate for /3r 
demanded in Proposition 13.11 (ii). 



Lemma 7.7. We have 



a R (n) = G{R) V X d , 



d^R 
d\F(n) 

where Xd is defined for squarefree d by 

(7 - 16) Xd '= 7 (d)G(iJ) ' 

and the quantity Gd is defined by 

G d (x):= Y, 

(q,d)=l 

Remark. The notation here is reasonably consistent with what is normal 
in sieve theory, as may be found for example in [TJJ|. For some further 
discussion of the weights in the special case F(n) = n, see Appendix |H] 
below. 
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Proof. We start with (|7. 13|) and use Mobius inversion to write 

l(r,F(n))=l = £ M m )> 
m\r,m\F(n) 

and hence 

m^.R:m\F(n) s,r:sr^.R,m\r 

Writing r =: mr' and q := sr', we thus have 

a R (n)= E E^a" 

m^i?:m|_F(n) q^R/m r'\q 

From (|7.2j) a calculation gives 

£ ^ = n^ 2 w(4r - = 

^ 7 mr' 7(m) 11 7(g) 7(771) 

r'|g p|g 

and the claim follows. □ 

Lemma 7.8. Suppose that d is squarefree. Then |Ad| ^ 1. 

Proof. This is a well-known fact in sieve theory. The proof (which may 
be found in 10 p. 100]) consists in observing that 

G(R) = ^h(l)G d (R/l) > (^h(l))G d (R/d) = 

l\d l\d ^ ' 

which implies the result immediately. □ 

Proof of Proposition I.V. Jl To prove (i) , it suffices in view of Lemma 17.51 to 
show that 

G(R) » t 
In |19l Lemma 4.1], the bound 

g ^ >* n 

p<R ,uv 

is obtained. The claim (i) now follows from (|1.7|) and the classical bound 
Uk< P ^-k/p) «fc (logR)- k . 

Let us now consider (ii), that is the statement /3fl(n) <C e N e . By Lemmas 
OandElwe have the bound f3(n) < G(R)d{F (n)) 2 . Recall that I-7O) < 
k/p for all p (cf. I|1.4jl ). together with the non-degeneracy assumption (|1.3j) . 
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which implies that j(p) 1/p. For any e > 0, we may therefore infer the 

bound 

(7.17) 

%) = ^? n ^# < kk ^ 2 n «* kMq) w- «/« «m <r 

p\q p\q 

p>k 

for any e > 0. Here, we have invoked well-known inequalities for the divisor 
function (recall that |ai|,|&i| < N, whence \F(n)\ <. k N 2k ). Using (|7T7|) . 
and recalling (|7.9|) . we infer that 

G(fl) R 6/2 < iV €/2 , 

concluding the proof of (ii). We remark that more precise bounds on G(R) 
may be found in [TT?] . 

To prove (iii) and (iv), it suffices by Lemma I7"3"l (vi) and Proposition 17. II to 
show that 

3 ro(9) II( 1 -7(p))/7(p)«e q^ 1 

p\q 

for all square- free q. But the left-hand side is just 3^^h(q), whence by 
("7.17J1 and the classical bound w(q) <C log qj log log q we are done. □ 

8. Appendix: A comparison of two enveloping sieves. 

Let us begin with a few further remarks concerning what we mean by an 
enveloping sieve. The idea of considering the sieve (3r not simply as a 
tool for obtaining upper bounds but as an interesting function in its own 
right is one we got by reading papers of Ramare 27 and Ramare-Ruzsa 
[2*5] . Ramare generously acknowledges that the essential idea goes back to 
Hooley |2U| . It is in those papers (and in fact so far as we know only in 
those papers) that one finds the term enveloping sieve. 

The name, though perhaps nonstandard, seems to us to be appropriate for 
describing a situation where one uses a single major ant for the primes to 
deal simultaneously with a whole family of problems which might normally, 
by themselves, be dealt with by a family of different sieves. 

In Proposition 13.11 take F(n) = n, so that GiR)- 1 ^, as defined in (fTTil . 
is a major ant for the primes themselves (or at least the primes > R). The 
expansion of /3r as a Fourier series in Proposition 13. II can be thought of as 
a way of using a sieve for the primes to tell us about the behaviour of the 
primes restricted to arithmetic progressions a(modg). This might normally 
be handled by a separate procedure for each (a,q). 
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The authors 18__ used a somewhat different majorant for the primes, which 
we call (3' R below (a closely related object was called v in [TH]). We were 
concerned with the estimation of correlations 



Here, then, a single sieve (3' R is being used to deal with configurations 
(n + hi, ■ ■ ■ , n + h m ) which might more normally be treated using a Selberg 
sieve adapted to the polynomial F(n) = (n + hi) . . . (n + h m ). This, we 
think, should also qualify as a use of an enveloping sieve. 

Let us return to the function 0r{ti). By Lemma 17.71 it is given by the 
formula 



G(R) is as defined in (|7,9[) . In this particular case we have from (|7.8|) that 
h(m) = l/(f)(m) and so 



The weights A^ EL (SEL stands for Selberg) are defined by (|7,l(i|k when 
F(n) = n we have 7(d) = 4>(d)/d, and thus 



transform of [3r could be described quite accurately, as was done for exam- 
ple in Proposition 17.11 There are many other uses for (3r, one of the most 
famous being to the so-called Brun-Titchmarsh problem of proving an up- 
per bound for the number of primes in an interval [x, x + y). The function 
(3r and the associated weights A^ EL were first considered by Selberg and 
constitute the simplest instance of Selberg's upper bound sieve (also known 
as the A 2 -sieve). 

As is well-known, the weights A^ EL arise from a certain natural quadratic 
optimization problem. Indeed for any real weights such that Ai = 1 the 
function 



(8.1) 



E n^Af/?ij(ra + h%) . . . R (n + h m ). 



d\n 





d\n 
df^R 
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forms a pointwise majorant for the primes > R. For a given integer N we 
might choose R to be a small power of iV and then attempt to choose the 
\d so that Ylm<N ' l P( n ) is as small as possible. After changing the order of 
summation one finds that 

(8.3) £>(n) = * £ ^ + 0(i? 2 sup A 5 ). 

In practice the error will be small, and we become interested in minimising 
the quadratic form 

subject to Ai = 1. Q(A) may be written as Yls^.R c l ) (fi) u 8 where 

(8.4) us :=J2 X d/d; 

S\d 

in these new coordinates the condition Ai = 1 translates to 

^2n(5)u s = 1. 

Observe that under these conditions we have 

(8.5) (KAHjCW^-g^y+OW)-', 

which makes it completely clear that the minimum of Q(X) is 1/G(R) and 
that this occurs when us = fi(5) / <f>(S)G(R) . This corresponds to the choice 

\ , _ \SEL 



Now it is well-known that G(R) ~ log R. In fact one has the asymptotic 

(8.6) G(R) = log R + 7 + V + o(l). 

p P(P - 1) 

This may be found in j^Tj, for example; Montgomery [23] traces the result 
back at least as far as Ward [3*7| . 

We remark that all this together with Lemma 17.81 (which, in our new nota- 
tion, tells us that |A^ EL | ^ 1) allows us to recover the fact that if R = iV 1 / 2-6 
then 

^T(3 R (n)=N(l+o e (l)). 
This also follows quickly from Proposition 13.11 (cf. Lemma 14. 1(1 . 
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As remarked earlier, we had occasion in our paper |18l Chapter 8] to con- 
sider a somewhat different majorant for the primes > R. To enable a 
comparison with /3r, we write this majorant in the form 



$.7) f3' R (n) = G(R)(J2 X o 

d\n 
d<R 



where 

Aw 



/z(d) log(fl/d) 



logfl 

Objects of this type were extensively analysed by Goldston and Yildirim 
1121 I13j . and in particular they saw how to asymptotically evaluate 
certain correlations of the form 1)8.1(1 provided R ^ N Cm . This was a 
crucial ingredient in our work in JH]) but we should remark that other 
aspects of the function (3' R were investigated much earlier, and indeed j3' R 
was known to Selberg. It has also featured in works of Friedlander-Goldston 
[S], Goldston ^U] and Hooley [2J_, among others. The consideration of the 
weights A GY and the associated majorant (3' R may be motivated by looking 
at an asymptotic for the weights A^ EL . In |271 Lemma 3.4], for example, 
one finds the result 

(8.8) G d (R/d) = \og(R/d) +7 + E -^tt +£— > 

^p(p-l) ^ p 

which leads in view of the definition Q8.2|) and Q8.6J) to the asymptotic 

\SEL 

//(d) \og(R/d) 

\GY 

Ad bgl? = ^ ' 

At the very least, this suggests looking at f3' R as defined in Q8.7JI above. The 
oscillatory effect of the Mobius function means that there is no a priori 
guarantee that (5' R is a close approximation to f3 R . It turns out however 
that (5 R and (3' R are, for many purposes, rather similar. 

One way to see this is to inspect the quadratic form Q(A GY ). The asymp- 
totic 

Q(A GY ) = r J ^ + 0(: ' 



log-R ' K \og 2 R' 

follows from a result of Graham |15| and can easily be proved using Gold- 
ston and Yildirim's method (as exposited, for example, in |18| Ch. 9]). 
Comparing with 1)8.5)1 and using the asymptotic (|8,fi[) one sees that 

£>(<5)K GY -uf^f = 0{-\-). 
tk log R 
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Here, of course, u GY and u SEL are related to A GY and A SEL respectively by 
the transformation (|8.4[) . If R <C iV 1 / 2-6 then this translates via ()8.3|) to a 
bound 



which implies that 



E 



Using the Cauchy-Schwarz inequality we now get 
WPr-PrWi ■= E^NlP'^n) - /3 R (n)\ 

< (E K „^(^ /2 (n) - < 2 (n)) 2 ) 1/2 x 

x (E lw (4 1/2 (n)+< 2 (n)) 2 ) 1/2 

= 0( i ^) 1/2 -(ll/3/ ? l|i + ll^l|i) 1/2 

On the other hand, from Lemma |4.1l we have = O(l). From the 

triangle inequality we thus obtain 

Lemma 8.1. Suppose that R <C N 1 / 2 ~ e . Then (3r and j3' R are close in that 
they satisfy the L 1 -estimate \\f3' R - (3r\\i = 0((log R)^ 1 / 2 ). □ 

Such an estimate is not, in itself, enough to allow us to deduce enough in- 
formation about the Fourier expansion of (3' R to prove a restriction theorem 
analogous to that obtained in fJUfor @r. Such Fourier information could, 
we believe, be obtained by an analytic method similar to that in but 
there seems little use for such a result. Going in the other direction, one 
might ask whether it is possible to obtain combinatorial information con- 
cerning correlations of the form (|8.1j) for (3r. In view of (|8.8j) it looks as 
though any such effort might require one to address the function 

H R (n) :=X>(d), 

d\n 
d^R 

concerning oneself in particular with the 2m th moment of this function. 
When m = 1 this was investigated by Dress, Iwaniec and Tenenbaum [7j 
and when m = 2 by Motohashi At this point there are already some 
rather thorny issues involved, and it does not seem worth the effort of 
pursuing the matter simply to obtain correlation estimates for (3r. The 
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function Hr is of some interest in its own right, however, and had an 
auxilliary role for instance in the ground-breaking work [Uj. 

Let us conclude with the following remark. As far as enveloping sieves for 
the primes are concerned, it seems that both (5r and j3' R have their own role 
to play. (5r is useful if one wants to do harmonic analysis, whereas (3' R is 
far more appropriate if one wishes to do combinatorics. The two majorants 
are, however, rather closely related. 
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